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RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/09/858 , 332B TIME 

Input Set : D:\SEQLIST.txt 
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55 <211> 

56 <212> 

57 <213> 

59 <220> 

60 <223> 
62 
63 
64 



Grigory S. 

Metal* Ion Affinity Tags and Methods for 



<110> APPLICANT: Tchaga, 

Jokhadze, George 
<120> 'title of invention 

■ Using the Same 
<130> FILE REFERENCE: CLON-056CIP 

<140> CURRENT APPLICATION NUMBER: US 09/858, 332B 
<141> CURRENT FILING DATE: 2001-05-15 
<150> PRIOR APPLICATION NUMBER: 09/404,017 
<151> PRIOR FILING DATE: 1999-09-23 
<150> PRIOR APPLICATION NUMBER: 60/101,867. 
<151> PRIOR FILING DATE: 1998-09-25 
<160> NUMBER OF SEQ ID NOS : 23 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 16. 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: affinity peptide 
<400> SEQUENCE: 1 



RECEIVED 

FEB 1 9 2003 

TECH CENTER 160012900- 



ENTERED 



His 
1 

<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 



Leu 



His Lys Glu Glu His Ala His Ala His Asn. 

10 15 



He His Asn Val 
5 

SEQ ID NO: 2 
LENGTH: 18 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER .INFORMATION: affinity peptide 
SEQUENCE: 2 

His Asp Asp His Asp Asp His Asp Asp His Asp Asp His Asp Asp His 

1 '5 10 15 

Asp Asp 

<210> SEQ ID NO: 3 
LENGTH: 16 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER .INFORMATION: affinity peptide 
<400> SEQUENCE: 3 

His Glu Glu His Glu Glu His Glu Glu His Glu Glu His Glu Glu His 
15 10 15 



67 .<210> SEQ ID NO: 

68 <211> LENGTH: 18 



fiie:77C:\CRF4\Outhoid\VsfI858332B:h 
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RAW SEQUENCE LISTING DATE: 02/12/2003 

PATENT APPLICATION: US/09/858 , 332B TIME: 10:35:04 

Input Set : D:\SEQLIST.txt 

Output Set: N:\CRF4\02112003\l858332B.raw 

69 <212> TYPE: PRT 

70 "<213> ORGANISM: Artificial Sequence 

72 <220> FEATURE: 

73 <223> OTHER INFORMATION: affinity peptide 

75 <400>- SEQUENCE: 4 

76 His Asp Glu His Asp Glu His Asp Glu His Asp Glu His Asp Glu His 

77 1 5 10 15 

78 Asp Glu 

82 <210> SEQ ID NO: 5 

83 <211> LENGTH: 18 

84 <212> TYPE: PRT 

85 .<213> ORGANISM: Artificial Sequence 

87 <220> FEATURE: 

88 <223> OTHER INFORMATION: affinity peptide 

90 <4 00> SEQUENCE: '5 

91 His Glu Asp His Glu Asp His Glu Asp His Glu Asp His Glu Asp His 

92 1 5 10 15 

93 Glu Asp 

97 <210> SEQ ID NO: 6 

98 <211> LENGTH: 4 

99 <212> TYPE: 'PRT ' 

100 <213> ORGANISM: Artificial Sequence 

102 <220> FEATURE: - 

103 <223> OTHER INFORMATION: enterokinase cleavage site 

105 <400> SEQUENCE: 6 

106 He Glu Gly Arg 

107 1 ' 

110 <210> SEQ ID NO: 7 

111 <211> LENGTH: 6 • 

112 <212> TYPE: PRT 

113 <213> ORGANISM: Artificial Sequence 

115 <220> FEATURE: ' 

116 <223> OTHER INFORMATION: a factor Xa cleavage site 

118 <400> SEQUENCE: 7 . 

119 Leu Val Pro Arg Gly Ser 

120 1 5 

123 <210> SEQ ID NO: 8 

124 <211> LENGTH: 8 

125 <212> TYPE: PRT 

126 <213> ORGANISM: Artificial Sequence 

128 <220> FEATURE: 

129 <223> OTHER INFORMATION: a thrombin cleavage site 

131 <400> SEQUENCE: 8 

132 His Pro Phe His Leu Val He His 

133 1 5 

136 <210> SEQ ID NO: 9 

137 <211> LENGTH: 10 

138 <212> TYPE: PRT 

139 <213> ORGANISM: Artificial Sequence 



, file://C~:VCRF4\OutHoid\VsfI858332Brh"tin """ " 2/12/03 — r 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/858 , 332B 



DATE: 02/12/2003 
TIME: 10:35:04 



Input Set : D:\SEQLIST.txt 

Output Set: N:\CRF4\02112003\l858332B.raw 



141 <220> FEATURE: 

142 <223> OTHER INFORMATION: .a renin cleavage site 

144 <400> SEQUENCE: 9 

145 Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 

146 1 5 10 

149 <210> SEQ ID NO: 10 

150 <211> LENGTH: 8 

151 <212> TYPE: PRT 

152 <213> ORGANISM: Artificial Sequence 

154 <220> FEATURE: 

155 <223> OTHER INFORMATION: an immunological tag 

157 <400> SEQUENCE: 10 , 

158 Asp Tyr Lys Asp Asp Asp Asp Lys 



5 

11 



Artificial Sequence 

an immunological tag 



159 1 

162 <210> SEQ ID NO: 

163 <211> LENGTH: 11 

164 <212> TYPE: ' PRT 

165 <213> ORGANISM: 

167 <220> FEATURE: 

168 <223> OTHER INFORMATION 

170 <4 00> SEQUENCE: 11 

171 Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 

172 1 5 • 10 

175 <210> SEQ ID NO: 12 

176 <211> LENGTH: 11 

177 <212> TYPE: PRT 

178 <2'13> ORGANISM: Artificial Sequence 

180 <220> FEATURE: 

181 <223> OTHER INFORMATION: an immunological tag 

183 <400> SEQUENCE: 12 

184 Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 

185 1 5. 10 

188 <210> SEQ ID NO: 13 

189 <211> LENGTH: 3430 

190 <212> TYPE: DNA 

191 <213> ORGANISM: Artificial Sequence 

193 <220> FEATURE: 

194 <223> OTHER INFORMATION: DNA sequence of vector contai 

195 recombinant enterokinase 

197 <400> SEQUENCE: 13 

198 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata 
199. cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 

200 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 

201 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt 

202 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa 

203 ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac 

204 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt 

205 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt 

206 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat 



ning cDNA of 



ataatggttt 60 
tgtttatttt 120 
atgcttcaat 180 
attccctttt 240 
gtaaaagatg 300 
agcggtaaga 360 
aaagttctgc 420 
cgccgcatac' 480 
cttacggatg 540 



^fii^//C:\CRF4\0^ 
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PATENT APPLICATION: 05/09/858,3326 



DATE: 02/12/2003 
TIME: 10:35:04 



Input Set .: D:\SEQLIST.txt 

Output Set: N:\CRF4\02112003\l858332B.raw 



207. gcatgacagt 

208 acttacttct 

209 gggatcatgt 
. 210 acgagcgtga 

211 gcgaactact 

212 ttgcaggacc 

213 gagccggtga 

214 cccgtatcgt 

215 agatcgctga 

216 catatatact 

217 tcctttttga 

218 cagaccccgt 

219 gctgcttgca 

220 taccaactct 

221 ttctagtgta 

222 tcgctctgct 

223 ggttggactc 

224 cgtgcacaca 

225 agctatgaga 

226 gbgcagggtc 

227 ttatagtcct 

228 aggggggcgg 

229 ttgctggcct 

230 tattaccgcc 

231 gtcagtgagc 

232 gccgattcat 

233 caacgcaatt 

234 tccggctcgt 

235 tgaccatgat 

236 acgctcatgc 

237 ggccttgggt 

238 gcagggattg 

239 agtggaaagc 
24 0 ctaggttgat 

241 acattgccat 

242 gtttaccaga 

243 gggcacttat 

244 tatcaaatga 
24 5 gtgcaggcta 
24 6 tgtgccaaga 

247 cactgcctaa 

248 gttttctaca 
24 9 gagcctggga 

250 ccagctggcg 

251 tgaatggcga 

252 accgcatatg 

253 gacacccgcc 

254 acagacaagc 

255 cgaaacgcgc 



aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
ggaacaggag 
gtcgggtttc 
agcctatgga 
tttgctcaca 
tttgagtgag 
gaggaagcgg 
taatgcagct 
aatgtgagtt 
atgttgtgtg 
tacgccaag.c 
ccacaacaag 
cgttgctctg 
gctggtgtcg 
agtgctaggc 
tgaccaaatt 
gatgcatctt 
agaaaatcaa 
atatcaaggt 
gaaatgtcaa 
tgaagcagga 
aaacaacaga 
tcgcccaggg 
tgagctcgta 
aaaccctggc 
taatagcgaa 
atgg.cgcctg 
gtgcactctc 
aacacccgct 
tgtgaccgtc 



tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
agcgcacgag 
gccacctctg 
aaaacgccag 
tgttctttcc 
ctgataccgc 
aagagcgccc 
ggcacgacag 
agctcactca 
gaattgtgag 
ttgaaggatc 
atcgatattg 
tatttcgacg 
gccgcccact 
ctgcatatgg 
gtcataaacc 
gaaatgaaag 
gtttttcccc 
tctactgcag 
caacagatgc 
ggggtagatt 
tggctcctgg 
gtgtatgccc 
attagctgag 
gttacccaac 
gaggcccgca 
atgcggtatt 
agtacaatct 
gacgcgccct 
tccgggagct 



ccataaccat 
aggagctaac 
aaccg'gagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
ggagcttcca 
acttgagcgt 
caacgcggcc 
tgcgttatcc 
tcgccgcagc 
aatacgcaaa 
gtttcccgac 
ttaggcaccc 
cggataacaa 
atctcatcca 
tcggaggaag 
atcaacaggt 
gcgtgtacgg 
catcaaatct 
cacactacaa 
tgaactacac 
caggaagaat 
acgtactgca 
cagaatataa 
cttgtcaggg 
ctggcgtgac 
gggtcccaag 
aattcactgg 
ttaatcgcct 
ccgatcgccc 
ttctccttac 
gctctgatgc 
gacgggcttg 
gcatgtgtca 



gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggggaaacg 
cgatttttgt 
tttttacggt 
cctgattctg 
cgaacgaccg 
ccgcctctcc 
tggaaagcgg 
caggctttac 
tttcacacag 
caatgtccac 
tgactecaga 
ctgcggagct 
gagaaatatg 
gacttctcct 
taaacggaga 
agattatata 
ttgttctatt 
agaagctgac 
cattacggaa 
ggattcaggc 
gtcatttgga 
gttcacagag 
ccgtcgtttt 
tgcagcacat 
ttcccaacag 
gcatctgtgc 
cgcatagtta 
tctgctcccg 
gaggttttca 



actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagcgt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
cctggtatct 
gatgctcgtc 
tcctggcctt 
tggataaccg 
agcgcagcga 
ccgcgcgttg 
gcagtgagcg 
actttatgct 
gaaacagcta 
aaagaggagc 
gaaggagcct 
tctctggtga 
gagccgtcta 
cagatagaaa 
aagaacaatg 
cagcctattt 
getggctggg 
gttccccttc 
aatatggtgt 
ggaccactca 
tatcaatgtg 
tggatacaaa 
acaacgtcgt 
ccccctttcg 
ttgcgcagcc 
ggtatttcac 
agccagcccc 
gcatccgctt 
ccgtcatcac 



600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
-2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3430 



£i lZ/k) j 
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RAW SEQUENCE LISTING DATE: 02/12/2003 

PATENT APPLICATION: US/09/858 , 332B TIME: 10:35:04 

Input Set : D:\SEQLIST.txt 

Output Set: N:\CRF4\02112003\l858332B.raw 

257 <210> SEQ ID NO:- 14 

258 <211> LENGTH: 279 

259 <212> TYPE: PRT 

260 <213> ORGANISM: Artificial Sequence 

262 <220> FEATURE: 

263 <223> OTHER INFORMATION: protein sequence of vector containing cDNA of 

264 recombinant enterokinase 
266 <400> SEQUENCE: 14 



9 £7 
Z O / 


l v JcL 


i nr 


lYiei. 


Tin 

lie 


i nr 


riO 


oer 


Leu 


Lys 


7\on 
HSp 


nis 


Leu 


Tl A 

lie 


nis 


7\ o fi 

Hsn 


Vdl 


9 £fi 
Z DO 


1 








0 










i n 
1 u 










1 D 




9 £ Q 

z o y 


His 


Lys 


Pin 
olU 


Pi n 
o±U 


HIS 


Hid 


nlS 


a i a 

Hid 


U -i o 

nlS 


Asn 


Lys 


Tlo 

i ie 


Asp 


Tie 

lie 


v a i 


ui y 


9*7 n 

Z 1 U 








9 n 
z u 










Z 0 










^n 






97 1 
Z / 1 


Gly 


Ser 


Asp 


Ser 


Arg 


Pin 

ulU 


p 1 \7 


hi a 


Trp 


Pro 


Trp 


v a J. 


v ax 


hi a 


Leu 


lyr 


97 9 
Z i Z 
















a n 










A R 








9 7 ^ 
Z 1 3 


Phe 


Asp 


Asp 


PI r\ 


tain 


vai 


Cys 


i^iy 


Ala 
Hid 


Car 

oer 


Leu 


v ai 


oer 


Arg 


Asp 


irp 


01 A 

Z 1 4 




50 




















o u 










97 t; 
Z J O 


Leu 


Val 


oer 


ai 3 
nxa 


7\ I -3 

hi a 


nlS 


uys 


vdl 


lyr 


pi u 
biy 


Arg 


Asn 


l v lct 


Pin 


ir r o 


oer 


97 6 
z / o 


65 










7 n 










7 R 










n 

o u 


97 7 
Z / / 


Lys 


Trp 


Lys 


7\1 2 

HI d 


Vdl 


Leu 


pi w 


Leu 


nlS 


i v iet. 


Hid 


oer 


Asn 


Leu 


l nr 


oer 


97 fi 
Z / 0 










o D 










qn 










-7 o 




97 Q 
Z / 


Pro 


Gin 


Tin 

i ie 


n n 

bill 


i nr 


TV vj-t 

hi g 


Leu 


lie 


A on 
Hbp 


Pin 
bill 


Tl o 

lie 


vdl 


Tl A 

lie 


7\ on 


XT Jl O 


rll b 


z o u 








1UU 










1U J 










1 1 u . 






9R 1 
Z O 1 


Tyr 


Asn 


Lys 


Arg 


Arg 


Lys 


Asn 


7A on 


7A on 
HSp 


Tin 

ne 


Hid 


i v ie u 


Met- 
ric L 


Hie 

hi s 


Leu 


PI n 


282 






115 










120 










125 








283 


Met 


Lys 


Val 


Ash 


Tyr 


Thr 


Asp 


Tyr 


He 


Gin 


Pro 


He 


Cys 


Leu 


Pro 


Glu 


284 




130 










135 










140 










285 


Glu 


Asn 


Gin 


Val 


Phe 


Pro 


Pro 


Gly 


Arg 


He 


Cys 


Ser 


He 


Ala 


Gly 


Trp 


28 6 


145 










150 










155 










160 


287 


Gly Ala 


Leu 


lie 


Tyr 


Gin 


Gly 


Ser 


Thr 


Ala 


Asp 


Val 


Leu 


Gin 


Glu 


Ala 


288 










165 










170 










175 




289 


Asp 


Val 


Pro 


Leu, 


Leu 


Ser 


Asn 


Glu 


Lys 


Cys 


Gin 


Gin 


Gin 


Met 


Pro 


Glu 


290 








180 










185 










190 






291 


Tyr 


Asn 


He 


Thr 


Glu 


Asn 


Met 


Val 


Cys 


Ala 


Gly 


Tyr 


Glu 


Ala 


Gly 


Gly 


292 






195 










200 










205 








293 


Val 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


; Pro 


Leu 


Met 


Cys 


Gin 


Glu 


294 




210 










215 










220 










295 


Asn 


Asn 


Arg 


Trp 


Leu 


Leu 


Ala 


Gly 


Val 


Thr 


Ser 


Phe 


Gly 


Tyr 


Gin 


Cys 


296 


225 










230 










235 










240 


297 


Ala 


Leu 


Pro 


Asn 


Arg 


Pro 


Gly 


Val 


Tyr 


Ala 


Arg 


Val 


Pro 


Arg 


Phe 


Thr 


298 










245 










250 










255 




299 


Glu 


Trp 


He 


Gin 


Ser 


Phe 


Leu 


His 


Glu 


Leu 


Val 


lie 


Ser 


Glu 


Phe 


Thr 


300 








260 










265 










270 






301 


Gly Arg 


Arg 


Phe 


Thr 


Thr 


Ser 





















302 275 

305 <210> SEQ ID NO: 15 

306 <211> LENGTH: 12 

307 <212> TYPE: PRT 

308 <213> ORGANISM: Artificial Sequence 
310 <220> FEATURE: 



"fiieT//C^ --- -2/T2/G3" — 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/09/858 , 332B 



DATE: 02/12/2003 
TIME: 10:35:05 



Input Set : D:\SEQLIST.txt 

Output Set: N:\CRF4\02112003\l858332B.raw 



Please Note: 

Use of n and/ or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:21; Xaa Pos . 2,3,5,6,7,9 
Seq#:23; Xaa Pos. 2,3,5,6,8,9 



■fne://C:\CKh^4\0utfi6ia\VOTl8'5833'2B:hfln 



z/ rz/ kjj 



